Measuring convergence in language model estimation using relative entropy
نویسندگان
چکیده
Language models are generally estimated using smoothed counting techniques. These counting schemes can be viewed as non linear functions operating on a Bernoulli process which converge asymptotically to the true density. The rate at which these counting schemes converge to the true density is constrained by the training data set available and the nature of the language model (LM) being estimated. In this paper we look at language model estimates as random variables and present an efficient relative entropy (R.E) based approach to study their convergence with increasing training data size. We present experimental results for language modeling in a generic LVCSR system and a medical domain dialogue task. We also present an efficient recursive R.E computation method which can be used as a LM distance measure for a number of tasks including LM clustering.
منابع مشابه
Entropy Generation of Variable Viscosity and Thermal Radiation on Magneto Nanofluid Flow with Dusty Fluid
The present work illustrates the variable viscosity of dust nanofluid runs over a permeable stretched sheet with thermal radiation. The problem has been modelled mathematically introducing the mixed convective condition and magnetic effect. Additionally analysis of entropy generation and Bejan number provides the fine points of the flow. The of model equations are transformed into non-linear or...
متن کاملE-Bayesian Approach in A Shrinkage Estimation of Parameter of Inverse Rayleigh Distribution under General Entropy Loss Function
Whenever approximate and initial information about the unknown parameter of a distribution is available, the shrinkage estimation method can be used to estimate it. In this paper, first the $ E $-Bayesian estimation of the parameter of inverse Rayleigh distribution under the general entropy loss function is obtained. Then, the shrinkage estimate of the inverse Rayleigh distribution parameter i...
متن کاملEstimation of Daily Evaporation Using of Artificial Neural Networks (Case Study; Borujerd Meteorological Station)
Evaporation is one of the most important components of hydrologic cycle.Accurate estimation of this parameter is used for studies such as water balance,irrigation system design, and water resource management. In order to estimate theevaporation, direct measurement methods or physical and empirical models can beused. Using direct methods require installing meteorological stations andinstruments ...
متن کامل(Measuring System Entropy Generation in a Complex Economic Network (The Case of Iran
An economic system is comprised of different primary flows that can be captured in macroeconomic models with complex network relations. Theoretically and empirically in this system, weak substitution or complementarity of environmental materials, like energy and other production factors such as capital, is undeniable. This is an effective critique on neoclassical economics. In this paper, we vi...
متن کاملRelaxation Rate, Diffusion Approximation and Fick’s Law for Inelastic Scattering Boltzmann Models Bertrand Lods
We consider the linear dissipative Boltzmann equation describing inelastic interactions of particles with a fixed background. For the simplified model of Maxwell molecules first, we give a complete spectral analysis, and deduce from it the optimal rate of exponential convergence to equilibrium. Moreover we show the convergence to the heat equation in the diffusive limit and compute explicitely ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004